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We study the convergence properties of the conditional (Kullback-Leibler) entropy in stochastic 
^ ' systems. We have proved very general results showing that asymptotic stability is a necessary and 

sufficient condition for the monotone convergence of the conditional entropy to its maximal value 
of zero. Additionally we have made specific calculations of the rate of convergence of this entropy 
to zero in a one-dimensional situations, illustrated by Ornstein-Uhlenbeck and Rayleigh processes, 
higher dimensional situations, and a two dimensional Ornstein-Uhlenbeck process with a stochas- 
. tically perturbed harmonic oscillator and colored noise as examples. We also apply our general 

O ' results to the problem of conditional entropy convergence in the presence of dichotomous noise. In 

ecu ■ 
both the single dimensional and multidimensional cases we are to show that the convergence of 
the conditional entropy to zero is monotone and at least exponential. In the specific cases of the 
Ornstein-Uhlenbeck and Rayleigh processes as well as the stochastically perturbed harmonic oscil- 
' lator and colored noise examples, we have the rather surprising result that the rate of convergence 

of the entropy to zero is independent of the noise amplitude a as long as cr > 0. 
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I. INTRODUCTION 



This paper examines the role of noise in the evolution of the conditional (or Kullback-Leibler or relative) entropy 
to a maximum. We were led to examine this problem because it is known that in invertible systems (e.g. measure 
^ , preserving systems of differential equations or invertible maps) the conditional entropy is fixed at the value with 
psj which the system is prepared QjIMSHj but that the addition of noise can reverse this invertibility property and lead 
On , to an evolution of the conditional entropy to a maximum value of zero. Here, we make both general and concrete 
I specific calculations to examine the entropy convergence. We carry this out by using convergence properties of the 
Fokker-Planck equation using 'entropy methods' [3, which have been known for some time to be useful for problems 
involving questions related to convergence of solutions in partial differential equations 0, 0, |EM ■ Their utility 
can be traced, in some instances, to the fact that entropy may serve as a Liapunov functional [ij. 

The outline of this paper is as follows. Section Hll introduces the dynamic concept of asymptotic stability and the 
■ notion of conditional entropy. (Asymptotic stability is a strong convergence property of ensembles which implies 
\ mixing. Mixing, in turn, implies ergodicity.) This is followed by two main results connecting the convergence of the 
I ' conditional entropy with asymptotic stability (Theorem^, and the existence of unique stationary densities with the 
convergence of the conditional entropy (Theorem Section ITTll shows that asymptotic stability is a property that 
C cannot be found in a deterministic system of ordinary differential equations, and consequently that the conditional 
^ ' entropy does not have a dynamic (time dependent) character for this type of invertible dynamics. Section llVl considers 
the stochastic extension where a system of ordinary equations is perturbed by Gaussian white noise (thus becoming 
non-invertible) and gives some general results showing that in this stochastic case asymptotic stability holds. Section 
IIV Al starts to consider specific one dimensional examples, and we show that the conditional entropy convergence to 
zero is at least exponential. We consider the Ornstein-Uhlenbeck process in Section FlV A II and a Rayleigh process 
d ' in Section FlV A 21 We consider multidimensional stochastic systems in Section FlV Bl showing that the exponential 
convergence of the entropy still holds. Specific examples of these higher dimensional situations are considered within 
the context of a two dimensional Ornstein-Uhlenbeck process in Section llV Cl with specific examples of a stochastically 
perturbed harmonic oscillator (Section |IV C 1|) and colored noise (Section |IV C 2ll as examples. Section Ivl applies our 
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general results to the problem of conditional entropy convergence in the presence of dichotomous noise. The paper 
concludes with a short discussion. 

II. ASYMPTOTIC STABILITY AND CONDITIONAL ENTROPY 

Let (X, be a cr-finite measure space. Let {P*}t>o be a semigroup of Markov operators on L^{X,ii), i.e. 
P*f > for / > 0, / P*/(x) n{dx) = J f{x) fi{dx), and = P*(P^/). If the group property holds for t, s € R, 

then we say that P is invertible, while if it holds only for t, s e R"*" we say that P is non-invertible. We denote 
the corresponding set of densities by /i), or V when there will be no ambiguity, so / G I? means / > and 

Jx fi^) ^-idx) — 1. We call a semigroup of Markov operators P* on L^{X, /i) asymptotically stable if there is a density 
/* such that P*/* = /* for alH > and for all densities / 

lim P*/ = 

t — *oo 

The density /* is called a stationary density of P*. 

We define the conditional entropy (also known as the KuUback-Leibler or relative entropy) of two densities /, g e 
'D{X,fj.) as 

Hc{f\g)^~ I f{x)\ogl^f,{dx). (2.1) 

Our first result connects the temporal convergence properties of He with those of P*. 

Theorem 1 Let P* he a semigroup of Markov operators on (X, /i) and /* he a positive density. If 

lim H,{P'f\f,) = (2.2) 

t — *-oo 

for a density f then 

limP*/-/*. (2.3) 

t — >oo 

3t 



Conversely, if P f* — /* for all t > and Condition \2.!i\ holds for all f with Hc{f\f*) > —oo, then Condition 
holds as well. 

Proof. To prove the first part, note that for densities f,g GV, we have from the Csiszar-KuUback inequality 

-\\f^g\\h>2H,if\g). 

In particular, 

-II^V-Mlii >2i?c(P7l/*). 

Thus if 

lim i?c(P*/l/*) = 0, 

t — >-oo 

then 

lim ||p7-/*IUi -0 

t — >oo 

and P* is asymptotically stable. 

Proof of the converse portion is not so straightforward. Assume initially that ///* is bounded, so < / < a/,. 
Since P*/* = /* and P* is a positive preserving operator, we have < P*/ < a/,. Making use of the inequality, 
proved in the appendix, 

-HM\g) < 1(^-1) gKdx), (2.4) 
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valid for all densities /, g, we arrive at 

ptf 



Hc{P'f\g) < J 



\p'f^Mfi{dx) 



< max{l,|a-l|}||P*/-M|i. 

Consequently, Hc{P*^ f\f*) ^ as i ^ oo for every density / such that ///* is bounded. 

Now assume that -ffc(/|/*) > — C!0 and write / in the form f = ga + fa where ga = f — fa and 

f _ / 0, f{x)>af4x), 
' ~ \ fix), < fix) < af^ix). 

We have 1 = = + and ||ga||i = ^ y^^j^^ f ^i{dx) ^ as a cx). Write P*/ in the form 

fa f fa \ 

Since ° is bounded, we have He . . ° \f* I > — oo and 

Vll/a||l / 



HcijArlf*) = / flog^fiidx), 



by the first part of our proof. Observe that 

l{f>af,} f 

which, because of the condition Hc{f\f*) > — oo, is convergent to as a ^ oo. Furthermore HdP* f\f*) > Hc{f\f*) 
for any density / and the function / i-^ Hdflf*) is concave. Combining these results and equality 12. 51 we obtain 

jt r\ r- \ ^ ll„ll TJ I ut ( 9a \ \ _n \ , \ \ r \\ tj ( r,t I fa 



Letting i — > oo, followed by taking the limit a — > oo completes the proof of the i/part, and of the theorem. 

The semigroup P* on i^(X, /i) is called partially integral if there exists a measurable function q : X x X — > [0, oo) 
and to > such that 



IX Jx 

Our next result draws a connection between the existence of a unique stationary density f.^, of P* and the convergence 
of the conditional entropy He- 

Theorem 2 Let P' he a partially integral semigroup of Markov operators. If there is a unique stationary density /, 
for P* and /* > 0, then 

lim HdP'folf*) - 

r— i-oo 

for all fo with -ffc(/o|/*) > -oo. 

Proof. From Pichor and Rudnicki [T5, Theorem 2] the semigroup P* is asymptotically stable if there is a unique 
stationary density /, for P* and /* > 0. Thus the result follows from Theorem^ 



P'°fix)> / qix,y)fiy)t,idy) 
Jx 

for every density / and 

qix,y) ^lidy) fi{dx) > 0. 
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III. CONDITIONAL ENTROPY IN INVERTIBLE DETERMINISTIC SYSTEMS 



In this section we briefly consider the behaviour of the conditional entropy in situations where the dynamics are 
invertible in the sense that they can be run forward or backward in time without ambiguity. To make this clearer, 
consider a phase space X and a dynamics St : X ^ X. For every initial point a;°, the sequence of successive points 
Stix'^), considered as a function of time t, is called a trajectory. In the phase space X, if the trajectory St{x°) is 
nonintersecting with itself, or intersecting but periodic, then at any given final time tf such that x-^ = Stj:{x^) we 
could change the sign of time by replacing t by —t, and run the trajectory backward using x-^ as a new initial point 
in X. Then our new trajectory S-t{x^) would arrive precisely back at x° after a time tf had elapsed: x^ — S-tf{x^). 
Thus in this case we have a dynamics that may be reversed in time completely unambiguously. 

We formalize this by introducing the concept of a dynamical system {•S'tjjgjj^, which is simply any group of 
transformations St : X ^ X having the two properties: 1. So{x) = x; and 2. St\St'{x)) = St+t'{x) for t,t' (zH. or 
Z. Since, from the definition, for any t g R, we have St{S-t{x)) — x = S-t{St{x)), it is clear that dynamical systems 
are invertible in the sense discussed above since they may be run either forward or backward in time. Systems of 
ordinary differential equations are examples of dynamical systems as are invertible maps. 

Our first result is very general, and shows that the conditional entropy of any invertible system is constant and 
uniquely determined by the method of system preparation. This is formalized in 

Theorem 3 // P* is an invertible Markov operator and has a stationary density , then the conditional entropy is 
constant and equal to the value determined by and the choice of the initial density /q for all time t. That is, 



H,{P'fM,)=HMo\U) 

for all t. 

Proof. Since P is invertible, by Voigt's theorem 26] with g — f* it follows that 

i^c(P*+*7o|/*) = i?c(P*>*/o|/*) 

> i?c(P*/o|/*) > i?c(/o|/*) 



(3.1) 



for all times t and t' . Pick t' = —t so 



and therefore 



Hcifolf*) > H^iP'folf*) > ffc(/o|/* 



Hc{P'fo\f*)^H,ifo\f.) 

for all t, which finishes the proof. 

In the case where we are considering a deterministic dynamics S* : X —f X where X G X, then the corresponding 
Markov operator is also known as the Frobenius Perron operator |l3j , and is given explicitly by 



P'foix)^foiS-\x))\J-\x)\ 



(3.2) 



where J *(a:) denotes the Jacobian of 5 *(a;). A simple calculation shows 



Hc{P'fo\f* 



P'h{x) log 



X 



P\f oix) 



fo{S-\x))\J-'{x)\\o^ 



dx 

Us-\x)) 



X 



Ms-'ix)) 



dx 



- / My) log 
Jx 

Hcifolf*) 



/o(y) 



My) 



dy 



as expected from Theorem |31 

More specifically, if the dynamics corresponding to our invertible Markov operator are described by the system of 
ordinary differential equations 



dxi 
~dt 



F^{x) 



i = 1, . . . , d 



(3.3) 
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operating in a region of R'^ with initial conditions Xi{0) = Xi,o, then the evolution of f{t, x) = P*/o(x) is governed 
by the generalized Liouville equation 



fli ^ 



dt ^ dxi 

The corresponding stationary density /* is given by the solution of 

^ dx, 



(3.4) 



(3.5) 



Note that the uniform density /, = 1, meaning that the flow defined by Eq. 13.31 preserves the Lebesque measure, is a 
stationary density of Eq. 13.41 if and only if 



Si-- 



(3.6) 



In particular, for the system of ordinary differential equations (|3.3|) whose density evolves according to the Liouville 
equation H3.4|l we can assert that the conditional entropy of the density P*/o with respect to the stationary density 
/, will be constant for all time and will have the value determined by the initial density /o with which the system 
is prepared. This result can also be proved directly by noting that from the definition of the conditional entropy we 
may write 



log ( i V 4 - 1 



/ 



HciflU)^- I fix) 
when the stationary density is . Differentiating with respect to time gives 

dx 



dx 



dHc _ 


- / — log 


7" 


dt ~ 


Jr' dt ^ 





or, after substituting from H3.4|) for (df/dt), and integrating by parts under the assumption that / has compact 
support, 

dHc ^ 
dt ^ Jr' f. 



dxi 



However, since /* is a stationary density of P*, it is clear from H3.4|l that 

dHc 



dt 



0, 



and we conclude that the conditional entropy i?c(P*/o|/*) does not change from its initial value when the dynamics 
evolve in this manner. 



IV. EFFECTS OF GAUSSIAN NOISE 



In this section, we turn our attention to the behaviour of the stochastically perturbed system 

^ = F.(x)+^a,,(a:)0, z = l,...,d (4.1) 

with the initial conditions Xi(0) — x^^O: where <Jij(x) is the amplitude of the stochastic perturbation and = is 
a "white noise" term that is the derivative of a Wiener process. In matrix notation we can rewrite Eq. 14.11 as 

dx{t) = F{x{t))dt + Y.{x{t)) dw{t), (4.2) 

where T,(x) = [cy (a;)]ij=i,..._d. Here it is always assumed that the Ito, rather that the Stratonovich, calculus, is used. 
For a discussion of the differences see and particular, if the o-y are independent of x then the Ito 

and the Stratonovich approaches yield identical results. 

To fully interpret (|4.2|) we briefly review the properties of the ^ when they are derived from a Wiener process. We 
say that a continuous process {w{t)}tyo is a one dimensional Wiener process if: 
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1. w(0) = 0; and 

2. For all values of s and t, < s < t the random variable wit) — w{s) has the Gaussian density 



g{t- s,x) 



1 



: exp 



2(i- s) 



(4.3) 



In a completely natural manner this definition can be extended to say that the d dimensional vector 

W{t) = {Wi{t), - ■ ■ ,Wd{t)}t>0 

is a d- dimensional Wiener process if its components are one dimensional Wiener processes. Because of the indepen- 
dence of the increments, it is elementary that the joint density of w(t) is 



git, xi,---,Xd) = g{t, xi)--- g{t, Xd) 



and thus 



that 



dx = 1, 



Xig{t, x) dx — 0, 



i — 1, • • • , d 



and 



In BTIl . 



XiXjg{t, x) dx = Sijt i, j = 1, ■ • ■ , 



1 i = j 
i^j 



(4.4) 



(4.5) 



(4.6) 



(4.7) 



(4.8) 



is the Kronecker delta. Therefore the average of a Wiener variable is zero by Eq. 14.61 while the variance increases 
linearly with time according to 1)4. 7|) . 

The Fokker-Planck equation that governs the evolution of the density function f(t, x) of the process x(t) generated 
by the solution to the stochastic differential equation H4.2|l is given by 



df ^dmx)f] , 1 ^ d^a,,{x)f] 



dt ^ 



dxi 



d 



dxidx 



(4.9) 



where 



k=l 



If k{t, X, xo) is the fundamental solution of the Fokker-Planck equation, i.e. for every xq the function (t, x) <—>■ k{t, x, xq) 
is a solution of the Fokker-Planck equation with the initial condition 5{x — xq), then the general solution J{t,x) of 
the Fokker-Planck equation (|4.9|l with the initial condition 



f{x,Q) = fo[x) 



is given by 



f{t,x)= / k{t,x,xo)foixo)dxo- 



(4.10) 
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From a probabilistic point of view k{t,x,Xo) is a stochastic kernel (transition density) and describes the probability 
of passing from the state Xq at time t = to the state a; at a time t. Define the Markov operators P* by 

P'Ux)= f k{t,x,xo)fo{xo)dxo, fo^LK (4.11) 



Then P*/o is the density of the solution x{t) of Eq. 14.21 provided that /o is the density of x(0). 
The steady state density /*(a;) is the stationary solution of the Fokker Planck Eq. H4.9|l : 

J^ d[F,{x)f] 1 ' d^[a,,{x)f] 

^ dx, 2 ^ dx^dx, ~ ^ ' 

In the specific case of X — and fi equal to the Lebesque measure, we recover from Eq. 12. li the conditional entropy 

7(^,2:) 



dx (4.13) 



If the coefficients aij and Fi are sufficiently regular so that a fundamental solution k exists, and Jyjt(t, x,y) dx = 1, 
then the unique generalized solution (|4.1(J|) to the Fokker-Planck equation H4.9|l is given by Eq. 14.111 One such set 
of conditions is the following: (1) the Fi are of class with bounded derivatives; (2) the are of class and 
bounded with all derivatives bounded; and (3) the uniform parabolicity condition holds, i.e. there exists a strictly 
positive constant p > such that 

d d 

ai.j{x)Xi\j > Aj , Xi,\-j e'R,x G TC^. 

ij = l i=l 

The uniform parabolicity condition implies that k{t,x,y) > 0, and thus P*f{x) > for every density, which implies 
that there can be at most one stationary density, and that if it exists then /* > 0. In this setting, the corresponding 
conditional entropy i?c(P* /o|/*) approaches its maximal value of zero, as i — > oo, if and only if there is a stationary 
density /, that satisfies (|4.12|) . 

A. The one dimensional case 

If we are dealing with a one dimensional system, d = 1, then the stochastic differential Eq. l4.1l simDlv becomes 

^^F{x)+aix)t (4.14) 

where again ^ is a (Gaussian distributed) perturbation with zero mean and unit variance, and (j{x) is the amplitude 
of the perturbation. The corresponding Fokker-Planck equation H4.9|l becomes 

df d[F{x)f] l d^[a^{x)f] 
Tt= 9^+2 dx^ ■ 

The Fokker-Planck equation can also be written in the equivalent form 

is called the probability current. 

When stationary solutions of (|4.15|) . denoted by f*{x) and defined by Pf/, — /, for all t, exist they are given as 
the generally unique (up to a multiplicative constant) solution of H4.12(l in the case d =1: 

d[F{x)U] , \ d^[a^{x)U] 
9^+2^^5 = (^-^^^ 



where 
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Integration of Eq. I4.18l bv parts with the assumption that the probabihty current S vanishes at the integration hmits, 
followed by a second integration, yields the solution 

This stationary solution will be a density if and only if there exists a positive constant K > Q such that /, can be 
normalized. 

We now discuss rigorous results concerning the one dimensional case. Let a(x) = a^{x). Assume that a, a' , and F 
are continuous on X = (a, P), where oo < a < /3 < oo. Let a{x) > for all x G (a, P) and Xq be any point in (a, 
If we want to study the long term behaviour, as t — > oo, of the process x{t) given by Eq. 14.141 we need to know that 
the process exists for all t > 0; in other words that there is no explosion in finite time, and that it lives in the interval 
(a,/9). If, for example, a were absorbing so that with positive probability we could reach a in finite time, then a 
solution / of Eq. 14. 151 would show a decrease in norm in L^{a, (3). 

There is a relation between the behaviour of x{t) at the boundary points a and /3, and the existence and uniqueness 
of solutions of the corresponding Fokker-Planck equation as described by In particular, if the condition 



exp 



^ 2F{z) 

0^ 



dz 



dx 



exp 



2Fiz) 



dz 



dx = oo 



(4.20) 



holds, then the generalized solutions P*/ of the Fokker-Planck equation exist and constitute a semigroup of Markov 
operators on L^(a, /?). If only one of the integrals is finite then this conclusion holds as well, but we restrict ourselves 
to Condition 14.201 because it is also necessary for the existence of a stationary solution of Eq. 14.141 c.f. Pinsky [l^ . 
Thus we conclude that there is a stationary density /* if and only if Condition 14. 201 holds and 



1 



exp 



^ 2F{z) 

0^ 



dz 



dx < 



(4.21) 



j^] provide a general method of establishing exponential convergence to zero of the relative entropy for the Fokker- 
Planck equation in a divergence form. Here, we adapt the arguments of Q to the one-dimensional case with X = (a, /?). 
Define 



a'^(x) K 
Dix) ^ and B(x) = - In 



a^{x) 



^ 2F{z) 



dz. 



where K is such that f-j,{x) = e is normalized. Then the Fokker-Planck equation 14. 151 can be rewritten as 



dt dx 



D{x) 



(4.22) 



and 



We are going to show that if there exists a constant A > such that for every x G X the following inequality holds 

1 



\d{x)B"{x)^D'{x)B'{x)+^-^^ 



-D"{x) > A 



then 



for all initial densities /o with i?c(/o|/*) > ~oo. 

The method of proof of Inequality 14.241 exploits Eq. 14.231 From Eq. 14.221 it follows that 



(4.23) 



(4.24) 



(4.25) 



where 
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Let ■ip{z) = — zlogz and Hc{t) = Hc{f\f*), where f{t,x) = P*/o(a^). Then we have 

Jx 

Differentiate He with respect to t 



dt 

use Eq. 14.251 and integrate by parts to obtain 

dH,{t) _ 
dt 

Now write I{t) — — ^^^d differentiate I{t) 



ij"{e^f)De-^g^dx. 

X 



Then use Eq. 14.251 rearrange terms and integrate by parts to obtain 's'l 

^ = -2 / V"(e^/)i?e-V {\d" -D'B' -DB" dx + f Tr(y • Z)e-^da; 

dt Jx \2 41? J Jx 



where 
and 



27/>"(e^/) 2^"'(e^/) 



Since —ip"{e^f)De^^g^ is nonnegative, from Eq. 14.231 it foUows that the first integral is less then 
2A tp" {e^ f)De^^ dx = —2\I{t). Since F is a negative definite matrix and Z is a nonnegative definite ma- 
trix, we obtain Tr(y ■ Z) < which completes the proof of the inequality 

^ . --no 

Integrating from t = s to t = oo and noting that I(t) and Hc{P* folf^.) go to zero as t — *■ cxd, we arrive at 

-I{s) < 2XHc{s), 

which leads to 

dHcjs) ^ 

— > -2XHc[s), 

as 

and finally to 

Hcit) > e-'^*i/c(0). 
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1. Example of an Omstein-Uhlenbeck process 



Trying to find specific examples of cf{x) and F{x) for which one can determine the time dependent solution f{t, x) 
of Eq. 14.151 is not easy. One solution that is known is the one for an Ornstein-Uhlenbeck process. Since it is an 
Ornstein-Uhlenbeck process, which was historically developed in thinking about perturbations to the velocity of a 
Brownian particle, we denote the dependent variable by v so we have (t{v) = a a, constant, and F{v) = —jv with 
7 > 0. In this case, Eq. 14. 141 becomes 

dv 

-^ = -jv + (4.26) 

with the corresponding Fokker-Planck equation 

df _ d[-/vf] a' d'f 
dt dv 2 

The unique stationary solution is 



(4.27) 



= ..oT, = (4.28) 



J —oo 

Further, from Risken [isl Eq. 5.28] the fundamental solution k{t,v,vo) is given by 



where 



^ 1 / (« - exp(-7^)t;o)^ ) 

Ki) = |^[l-e-^^*]. (4.30) 

Note that for a given y and t the function k{t,-,y) is a Gaussian density with mean wq exp(— 7i) and variance b(t). 
Since k{t, v, vq) > for all v, vq e R, the semigroup of Markov operators 



P^h{v)=f k{t,v,vo)foivo)dvo 

Jk 

satisfies all of the conditions of Theorem|21 Thus we can assert from Theorem |21 that limt^oo Hc{P* fo\f*) = for all 
/o with Hcifolf*) > -oo. 

In the case of the Ornstein-Uhlenbeck process, the sufficient condition (|4.23|) for the exponential lower bound on 
the conditional entropy reduces to 



where 



Thus A = 7 and Eq. 14.241 becomes 



DB"{v) > A 



and B"{v) = ^. 



for all initial densities /o with i?c(/ol/*) > — oo. 

We will now show that this lower bound is optimal. Let us first calculate the conditional entropy of two Gaussian 
densities. Let 91 , (72 > and let zi, Z2 G R. Consider densities of the form 

gi{x,Zi) \ —exp{-qi{x - Zi f}, x e R, i = l,2. 

V TT 

Then 

gi{x,zi) jq^ , n2 , / ^2 

log r-=l0gW qi[x - Zi) + q2[X - Z2) ■ 

g2[X, Z2j V 92 
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Since 



gi(x, zi)x'^ dx — h ^1 and / gi(x, zi)x dx = zi, 

R 2gi 



we arrive at 



Hc{gi{-,Zl)\g2{-,Z2)) = ^log — + ^ ( 1 - — ) - 92(21 - Z2f. 

2 qi 2 \ qi/ 



(4.31) 



Now let /o be a Gaussian density of the form 



/o(w) = J— cxp{-ci(u-C2) }, 



where Ci > and C2 G R. Since 



we obtain by direct calculation using Eq. 14.291 



-P*/o(«) = / k{t,v,vo)foivo)dvo, 



R 



PVo(t;) = 



Cl 



7r(e-27t + 2ci6(t)) 
Consequently, from Eq. 14.311 with qi = 



exp 



Cl 



-27* + 2ci6(t) 



I w - C2e 



-7^ 



Cl ^ 

_„ tttj 92 = -Zi = 026^'''*, and Z2 = 0, it follows that 



i?c(P*/o|/*)-7Tlog 
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1 



2V (T^ci a2 



(4.32) 



In particular, if ci = ^ then HJfolf*) = ^ and 



Hc{P'h\U) = e-2^*i?c(/o|/*), i > 0. 



2. A Rayleigh process 

Another example for which we have an analytic form for the density f(t,x) is that of a Rayleigh process i19l page 
135-1361. In this case we have 



dv cr"^ 

— = -yv + — + cr4 e 0, 00) 

dt Zv 



and the associated Fokker-Planck equation is 



dt dv 



^"-2^1^ 



2 dv^' 



(4.33) 



(4.34) 



The unique equilibrium solution, with the proper normalization, is given by 



(4.35) 



and the fundamental solution is 



k{t,v,vo) = — T^-e 



-2ri7t 



(4.36) 
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where the L„ are the Laguerre polynomials of order n. This form of k is not particularly useful in calculations. The 
Rayleigh process is also known as a two-dimensional radial Ornstein-Uhlenbeck process [2 7^ with parameters 7 and cr, 
and as such its transition probability is given by 20] 

where hit) = — (1 — e ^''*) and Iniz) — T^^T n — rrrr is the modified Bessel function of the first kind. Since 
w ^ ^ ^k=o (2*=fc!)2 

k{t,v,Vd) > for all t,v,vo > 0, again from Theorem |21 it follows that limt^oo Hc{P* fo\f*) — for all /q with 

HcUo\I*) > -oo- 

In the case of the Rayleigh process, the sufficient Condition l4.23l for the exponential lower bound on the conditional 
entropy reduces to 



where 



Thus A = 7 and Eq. 14.241 becomes 



DB"{v) > A 



D = ^ and B"{v)^'^ + \. 



for all initial densities /q with Hc{fo\f*) > —00. 



B. The multidimensional case 



In this section first we turn our attention to the existence of a stationary density in the case of multidimensional 
diffusion when the matrix E(a;) is nonsingular at every point x. This case is much more involved and does not yield 
simple necessary and sufficient conditions for the existence of a stationary density. However, when we consider two 
specific examples much more can be said. 

Let us assume that F and a are of class and respectively and that 

d 

^ a,y (a;)AAi > 0, xeR'^ and A G R"* - {0}. 

Under these assumptions, the so-called Liapunov method implies that if there exists a nonnegative function V of 
class on R'^ and a positive constant r > such that 

sup L-V{x) < 0, (4.38) 

|a:| >r 

where the operator 

then there is a unique stationary density and f*{x) > for all x. Again, from Theorem|211imi^oc Hc{P^ fo\f*) = 
for all /o with Hc{fvi\f*) > —00. If we simply take V{x) — \x\'^, then Condition 14. 381 becomes 



sup ^ aij{x) + 2'^XiFi{x) < 0. 



|a:|>r 



One can also write an integral test |2j| which reduces to Condition 14. 2 II in the one-dimensional case. Let 

d 

\x\^ - ' \x\~ 



1 1 

Ai{x) = |— |2a;'^a(a;)x = j-r^ ^ ai-j{x)xiXj, 
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and 



^2(2;) = Tr(a(x)) + 2(x, F{x)) = ^a,,(a;) +2^ 



^2(2;) — Ai(x) 

p[r) = sup and Q!(r) = mf Ai(a;j. 

\x\=r Ai{x) \x\=r 



Now define 



Finally, for r > 1, let 



s 

and assume that there exists Tq > such that the following two conditions hold: 

/•OO /"OO -1 

/ e-^'^^'Ur^oo and / — -e"'^'') dr < 00. (4.40) 

Jro J TO 

We claim that Condition I4.4UI implies Condition 14.381 To sec this, for r > tq define 

J{r) = / e-^(^) / —e^^iUqds, 
Ji Js Oi{q) 

and take V to be of class on R'' such that V(x) — J{\x\) for \x\ > tq. It is easy to check that 

2L~Vix) = A,ix)J"i\x\) + :l^(A2ix) - A,ix)) 

\x\ 

and that 2L-V{x) < -1 for |a;| > tq. 

We now discuss the speed of convergence of the conditional entropy. 8] provide a general sufficient condition for 
such convergence in the case of multidimensional diffusions that the Fokker-Planck equation can be written in a 
divergence form: 

^^dW{D{x)iVf + fVB{x))), (4.41) 
where D(x) is locally uniformly positive and B(x) is a real- valued function such that the stationary solution 

is a density. For simplicity assume that D{x) = £) is a constant. Then the Fokker-Planck eauation l4. 411 describes the 
evolution of densities for the system 

dx{t) = -DVB{x{t))dt + V2DIdw{t), (4.42) 
where / is the identity matrix. If there is A > such that 

^{.dx,dxj\^^^^^ ^ - ^"^'^^^ 

then we have 

Hc{P'fo\f*) > e-^^'H,{fo\U). (4.44) 
for all initial densities /o with i?c(/o|/*) > —00. 
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C. Multidimensional Ornstein-Uhlenbeck process 

Consider the multidimensional Ornstein-Uhlenbeck process 

rlnf 

— = Fx + (4.45) 

where F is a d x d matrix, Y. is a d x d matrix an ^ is d dimensional vector. The formal solution to Eq. 14.451 is given 
by 

xit) = e*-^x(0) + /* e(*-")^S dwit) (4.46) 
Jo 

where e*^ — X^riLo flT^" ^® fundamental solution to X{t) = FXit) with X{Qi) = I, and w{t) is the standard 
d-dimcnsional Wiener process. From the properties of stochastic integrals it follows that 

t 



Jo 

has mean and covariance 

ft 

R{t) = ET^{t)ri{tf ^ e^^SS^e^^^ds, 
Jo 

where is the transpose of the matrix B. The matrix R{t) is nonnegative definite but not necessarily positive 
definite. We follow the presentation of and For each t > the matrix R(t) has constant rank equal to the 
dimension of the space 

[F, S] := {F'-iEe, : j = 1, . . . , rf, = (5,i, . . . , S^^f}. 

If TO = ranki?(t) then d — m coordinates of the process -qit) are equal to and the remaining m coordinates constitute 
an TO-dimcnsional Gaussian process. Thus if to < d there is no stationary density. If ranki?(t) = d then the transition 
probability function of x{t) is given by the Gaussian density 

Kt,x.x,) - (2^)^/2(detfl(^))i/2 ^^P{-^(^ - e'^x,)^R{t)-\x-e'^x,)}, (4.47) 

where R{t)~^ is the inverse matrix of R{t). An invariant density /* exists if and only if all eigenvalues of F have 
negative real parts, and in this case the unique stationary density has the form 

= (2.)^/^(deti^.)V^ '"4"^""''-'4 ' ^'-''^ 
where is a positive definite matrix given by 

R, = / e^^SE^e^^'ds, 
Jo 

and is a unique symmetric matrix satisfying 

FR, + = -SS^. (4.49) 

We conclude that if [F, S] contains d linearly independent vectors and all eigenvalues of F have negative real parts, 
then from Theorem|2]it follows that limt^oo Hc{P* fo\f*) = for all fo with -ffc(/o|/*) > — oo. 

To simplify the following we first recall some properties of multivariate Gaussian distributions. Let Qi, Q2 be 
positive definite matrices and let 

1 ^1' ^T^-l, 



Then 
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gi{x,zi) 1 det(52 1 



log T = t: log 



l{x - zifQ^\x ~ zi) + \{x ~ Z2)^Q^'(a; - z^). (4.50) 



gi(x,z2) 2 ^detQi 2' -^i v ^, 

Since J gi{x, 0)xx'^ dx — Qi, we have 

gi{x, zi)xx'^ dx = Qi + ziz'[ and / gi{x, zi)x dx — zi. 



Note also that z'^Qz can be written with the help of the trace of a matrix as Tr[(5z2;"^] for any matrix Q and any 
vector z. Consequently, 

HM:^i)\92{;Z2)) = ^log^^ + \n{Qi' - Q2')Qi] - \^AQ2\^i - ^2){zi - z^n (4.51) 
Now let /o be a Gaussian density of the form 

/o(.) - (2.)./.(dety(0))i/2 {^(^ - - -(0))} (4.52) 

where V{Q) is a positive definite matrix and to(0) G R"*. From Eq. 14.461 it follows that x{t) is Gaussian with the 
following mean vector m{t) — e*^TO(0) and covariance matrix V{t) — e^^V {Q)e*^ + R{t). Hence 

P7o(x) = y^,^y/2 - ^it)fVit)-\x - Mt)) 

Consequently, from Eq. 14.511 with Qi = V{t), z\ — ni{t), Q2 — R*, and Z2 = 0, we obtain 

i?c(P*/o|/*) - I log + iTr(I - i?; V(i)) - lTr{R:'mit)mitf). (4.53) 

In particular, if V{0) = i?*, then V{t) = i?, and 

HaiP'MU) - ~^TiiR:'mit)m{tf) (4.54) 

for alH > and every /g of the form given by Eq. 14.521 

As a specific example of the multidimensional Ornstein-Uhlenbeck process, to which Equation 14.441 and 14.541 can 

2A 

be applied, consider the case when S = crl and is a diagonal matrix, F = —XI with A > 0. Then R^ = -^l and 
f*{x) = e^-^(^\ where 

B{x) = ilog((27r)^deti?,) + ^x'^x. 



= AT. 



Thus Condition 14.431 becomes 

2 \dx^dxj y ^^^^^^ ^ 

Since e*^ = e""^*!, we conclude from Eg iiat ion 14 . 541 that Hc{P*fo\f*) = e^^^^Hc{fo\f*), so the estimate in Equation 
14.441 is optimal. We will show in the next section that in the case of a non-invertible matrix E a slower speed of 
convergence might occur. 

To obtain a lower bound on the conditional entropy for the case of a not necessarily invertible matrix E and a 
general /o, we make use of the following inequality, proved in the appendix. 



Hc{P'fo\f*)> / / /o(2/i)/*(y2)i?c(fc(t,-,yi)|fc(t,-,2/2))dyidy2. 



(4.55) 
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From Eq. 14.511 with Qi = R{t), Q2 = R{t), z\ = e^^yi, and Z2 = e^^yi it follows that 

H,{k{t, ; y,)\k{t, ; V2)) = -iTr[i?(t)-ie*^(yi - y^Kvi - ^2)^6*^"]. 
Since Tr[i?(t)^^e*^yy^e*^^] < ||?/|p||e*^^i?(t)~^e'^|| for any y by the Schwartz inequality, we obtain from Eq. 14.551 

i?c(P*/i|/*)>-^l|e*^inii?(i)-^ll / J \\yi-y2\\'Myi)My2)dyidy2. (4.56) 

Finally, observe that the norm of R{t)^^ is bounded, because R{t)~^ converges to R^^ as t ^ oo. Thus for sufficiently 
large t we have 

H,{P' fi\f*)>~\\e'^\\'\\R:'\\ J J \\yi~y2\\'fo{yi)My2)dy^dy2. (4.57) 

1. Harmonic oscillator 

Consider the second order system 

dx 

with constant positive coefficients m, 7 and a. Introduce the velocity w = — as a new variable. Then Eq. 14.581 is 

at 

equivalent to the system 



dx 
Itt 



(4.59) 



dV r, 

m— = -"fv - uj X + a^., (4.60) 
dt 



and the corresponding Fokker-Planck equation is 

df ^ d[vf] ^ 1 d[{jv + u;^x)f] ^ 8^ 
dt dx m dv 2m? dv"^ 

We can use the results of Section HV Cl in the two dimensional setting with 

/ 1 \ /o \ 

F=\ uj'^ 7 and E = Q . 

V / \ m J 

Since 

VI — J I 

the transition density function is given by Eq. 14.471 The eigenvalues of F are equal to 



\ m / \ m 



—7 + -1/72 — 4mti)2 —7 — ^72 — 4ma;2 
= 2^ ""'^ ' 

and are either negative real numbers when 7^ > 4mw^ or complex numbers with negative real parts when 7^ < Amuj'^. 
Thus the stationary density is given by Eq. 14.481 As is easily seen i?, , being a solution to Eq. 14.491 is given by 
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The inverse of the matrix i?* is 



* cr2 i m 



and the unique stationary density becomes 

As in Section HV CI we conclude that Hmt^oo Hc{P*^fo\f*) = for all fo with iJc(/o|/*) > — oo. 

The bound on the temporal convergence of Hc{P* f[)\f*) to zero, as given by Eauation l4.57l is determined by ||e'^|p. 
Thus we are going to calculate ||e*''^|p and to see the nature of the general formula (|4.54(l for the conditional entropy 
in the case of the harmonic oscillator. First, consider the case when Ai = A2, that is 7^ = 4mw^, and set 

A = — L. 
2m 

Then we have 

1 A , tF _ xtf i-^t t 



^=(,-A^ 2Aj e-=e-(^_^2, ^ + 

and 

||e*^||2 = e2^*(l + (3A2 + l)i2). 
If we take m(0) = (1, A)"'" then m{t) = e'^*m(0) and Eq. 14. 541 becomes 

But, if we take to(0) = (1, 0)"^, then m{t) = e^*(l - Xt, -A^i)^ and 

Hc{P'k\U) = e''* ((1 - \tf + H,{h\U). 

Now consider the case when Ai 7^ A2 are real and define, for i > 0, 

A2e^i* - Aie^2* , e^^* - e^i* 
ci(t) = and C2{t) = — — . 

A2 — Ai A2 — Ai 

Then 



Cl{t) C2{t) 

c[{t) c'^it) 



and 

If we take m(0) = (1, Xi)'^ , i = 1, 2 in Eauation l4.54l then m{t) = p/'*to(0) and 
Finally, consider the case of complex Ai, A2 and let 



7 J « y/inw?^^ 

a — ~ - — and p = — . 

2m 2™ 

Then 

tF _ e"' / /3cos(/3t) -asm{(3t) sin(/3t) 



/3 \^ -(a^ +/32)sin(/3i) /3 cos(/3i) + a sin(/3t), 
and we have 

||e*^||2 = ^ (2/32cos2(/3i) + {2a^ + {c? + i?f + l)sin2(/3i)) 
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2. Colored noise 

Consider the system 



— = —ax + rj (4.61) 
at 



where a > and 77 is the one dimensional Ornstein-Uhlenbeck process with parameters 7, cr > 

- = -7r; + at 

Then Eq. I4.61l is equivalent to the system 



da; 

dv 
'dt 



-ax + V (4.62) 
-7f + cr£., (4.63) 



and the corresponding Fokker- Planck equation is 

9/ ^ d[{ax - v)f] ^ d[-fvf] ^ a2 a^f 
dt dx dv 2 dv"^ 

We can use the results of Section FlV Cl in the two dimensional setting with 



Observe that 

The eigenvalues of F are given by 



) ( ^ ) ' ( -7^ 



Ai — —a and A2 = —7, 

and are evidently negative. Finally, 

27(0 + 7) \la + -f 

and 



* cr-^ I — 1 — 



a 



Thus the unique stationary density is equal to 



y^ja + 7) r Q + 7 / , . «^ 2 o ^ 2N 

7*(x, w) = 5 exp < — [aia + p)x — laxv + v I 



As in Section ITV CI we conclude that hmj^oo Hc{P* fo\f*) = for all /o with Hc{fo\f*) > ~oo. 

Finally, we are going to calculate ||e'^|p to see how the general formula 14.541 for the conditional entropy bound 
looks in this special case. First consider the case when a 7^ 7. The fundamental matrix is given by 



"7 f-0 
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and 



|e*^||2 = e-2^* + e-2«*. 



If we take m(0) — (1, 0)'^, in Eauation l4.54l then m{t) — e "*m(0) and 

Hc{P'h\f*) = e-^"'H4fo\f.). 
Similarly, for m(0) — {I, a — 7)^ we have m{t) — e~'^*m{0) and 

HciP'Mf*) = e~'^*i^c(/o|/*)■ 
Now consider the case when a = 7. We have 

^f^J ) and I |e*^| |2 = e'^^* + e'^"*. 



Equation 01 becomes for m(0) = (0, 1)^ and m{t) = e"''*(i, 1)^ 

Hc{P'h\U) - e-27*(2f t2 _ ^ l)i7c(/o|/*)- 



MARKOVIAN DICHOTOMOUS NOISE 



Another example where we can use our results is the case of dichotomous noise [l^ Section 8]. The state space of 
the Markovian dichotomous noise ^(i) consists of two states {c+, c_} and is characterized by a transition probability 
from the state c+ to c_ in the small time interval Ai given by a At + o(Ai), and from the state c_ to c+ given by 
/3A< + o(At), where a, /? > 0. It has the correlation function 



[a + pY 



A system subject to this type of noise is described by the following equation 

^ = F(x)+a(x)e 

The pair {x{t),S,(t)) is Markovian and writing a(x, c±) — F{x) + c±(t{x) we arrive at 

dx{t) = a{x{t),^{t))dt. 

The process ^{t) determines which deterministic system 

dx+ , . dx_ 
-^=a{x,c+) or —^^a{x,c^) 

to choose. We assume that given the initial condition x±{0) — x, each of these equations has a solution x+{t) and 
X-{t), respectively, defined and finite for all t > 0, and we write tt*^_{x) — x+{t) and Tr*_{x) = x^{t). Furthermore, we 
assume that there is a minimal open set X such that Tr^{X) C X for all t > 0. Let B{X x {c+,c_}) be the sigma 
algebra of Borel subsets of X x {c+, c_} and let ^ be the product measure on B{X x {c+, c_}) which, on every set of 
the form B x {c±}, is equal to the length of the set B. The norm of any element / of the space L^(X x {c+, c_}, /i) 
is equal to 



\f+{x)\dx+ / \f-ix)\dx, 
X Jx 

and we now define a semigroup of Markov operators on this space which describes the temporal evolution of the 
densities of the process {x{t),£^{t)). 

The evolution equation for the densities f±{t,x) — f{t,x,c±) of the process {x{t),£^{t)) is of the form 

df+ d[aix,c+)f+] r ,nf 



dt dx 

df- d[a(x,c^)f^ 



dt dx 



+ (5.2) 
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Formally writing / = (/-|_, /-)^, we arrive at the following equation 



where 



^^Af + Mf (5.3) 



d[a{x,c+)f^ 



M = { " ^« ) and Af=\ p,, f 



dx 

Then the operator A generates a semigroup of Markov operators T* on L^{X x {c+, c-}, /i). Since M is a bounded 
operator, A + M also generates a semigroup of Markov operators P*, and the following holds 



Jo 



for every /g £ L^{X x {c+, c_}, /x). The semigroup P* gives the generalized solution f{t, •) of Eq. 15. 31 with the initial 
condition /(O, •) — /q. Using the results of Pichor and Rudnicki [l^, Proposition 2] we infer that if there is an xq G X 
such that a(a;o,c+) 7^ a(xo,c_), then the semigroup P* is partially integral, and if X is the minimal set such that 
7r±(-'^) ^ then this semigroup can have at most one stationary density. Consequently, if a stationary density 
exists, Theorem 12 implies that 

lim H,{P'fo\U) = (5.4) 
for all /o with Hdfolf^) > —oo, where the conditional entropy for on L^{X x {c+, c_}, fi) is equal to 



Hc{P'k\h)^-l P'f{x,c+)log^l^p^dx^ [ P^f{x,c^)log^^ip^dx. 

Jx c+j Jx j^(x,c-) 

by Eq. O 

The density of the state variable x{t) is an element of the space L^{X, m), where m is the Lebesque measure on X. 
It is given by 

p{t, x) = P*/o(x, c+) + PVo(a;, c_). (5.5) 
Thus the stationary density of P* gives us the stationary density of x{t) 

p*{x) = /,(a;,c+) +/,(a;,c_). 

Since 

HMt)\p.) = - f p{t,x)\ogP^dx > i?c(P*/o|/*), (5.6) 

we conclude that 

lim i/,(p(t)|p,) = 0. (5.7) 

i — 'oo 

For a general one-dimensional system with dichotomous noise, one can derive a formula for the stationary density. 
For the sake of clarity, we follow |2^] and restrict our discussion to the case of symmetric dichotomous noise where 

C-i- = — c_ = c and a = P, 

and 

a{x, c+)a{x, c_) < 0, 

where the zeros of a{x, c+)a(x, c_) are the boundaries of X. Then the unique stationary density of P* is given by 
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f*{x,c±) ^ K—-^ — -yexpi-a / ( — \ + -f- — V f" ' 

\a[x,c±)\ I J \a{z,c+) a(z,c_)/ 

where K is a normalizing constant. 

As a specific example, consider the linear dichotomous flow 

dx ^ 

where 7 > 0. Then c± — ±c and a(x, c±) = —7a; ± c for x G R. Thus 

7 

and the state space is 

X 

The stationary density in this case is given by 

f*{x,±c) 



c c 
7' 7 



„2 \ a/7-1 



„2 



Tlx + C \ 

where the normalizing constant is equal to 



K- ^ I' 



-2q/7 



and B is the beta function. Since a{x,+c) > > a{x,—c) for all x E X, Conditions 15.41 and 15.71 hold, where the 
stationary density of the state variable x{t) is equal to 

p*(x) ^ — — \ — ~^ 



VI. DISCUSSION 



Here we have examined the evolution of the conditional (or Kullback-Leibler or relative) entropy to a maximum in 
stochastic systems. We were motivated by a desire to understand the role of noise in the evolution of the conditional 
entropy to a maximum since in invertible systems (e.g. measure preserving systems of differential equations or 
invertible maps) the conditional entropy is fixed at the value with which the system is prepared. However the 
addition of noise can reverse this property and lead to an evolution of the conditional entropy to a maximum value 
of zero. We have made concrete calculations to see how the entropy converges, and shown that it is monotone and at 
least exponential in several situations. 

Specifically, in Section^ we introduced the dynamic concept of asymptotic stability and the notion of conditional 
entropy, and gave two main results connecting the convergence of the conditional entropy with asymptotic stability 
(Theorem^ , and the existence of unique stationary densities with the convergence of the conditional entropy (Theorem 
121). In Section IlIII we illustrated the well known fact that asymptotic stability is a property that cannot be found in 
a deterministic system of ordinary differential equations, and consequently that the conditional entropy cannot not 
have time dependent behavours character for this type of invertible dynamics. Section IIVI introduced a stochastic 
extension to this invertible and constant entropy situation in which a system of ordinary equations is perturbed by 
Gaussian white noise (thus becoming non-invertible) . We were able to give some general results showing that in this 
stochastic case asymptotic stability holds. Then in Section FlV Al we considered specific one dimensional examples, 
and showed that the conditional entropy convergence to zero is monotone and at least exponential, considering the 
specific examples of an Ornstein-Uhlenbeck process in Section IIV A II and a Rayleigh process in Section IIV A 21 We 
went on to look at multidimensional stochastic systems in Section IIVBI showing that the exponential convergence of 
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the entropy still holds. Specific examples of these higher dimensional situations were considered within the context 
of a two dimensional Ornstein-Uhlenbeck process in Section II V CI with specific examples of a stochastically perturbed 
harmonic oscillator (Section IIV C 1(1 and colored noise (Section IIV C 2|l as examples. In the specific cases of the 
Ornstein-Uhlenbeck and Rayleigh processes as well as the stochastically perturbed harmonic oscillator and colored 
noise examples, we have the rather surprising result that the rate of convergence of the entropy to zero is independent 
of the noise amplitude cr as long as cr > 0. The final SectionlVl applied our general results to the problem of conditional 
entropy convergence in the presence of dichotomous noise. 
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APPENDIX 
1. Proof of inequality [531 

Jensen's inequality states that when is a normalized measure and is a concave function then 

(^J gizMdz)^ > J 4>{9{z))y{dz). 

Since log is a concave function, Jensen's inequality implies 

//log-M(rfa;) < log / f-^{dx). 
J 9 J 9 



Now note that 



Thus 



fj,{dx) 



9^iidx) ~ 1. 



Hc{f\9) < log 



1 + J (^-l) 9Kdx) 



< 



9 



1 9^l{dx). 



2. Proof of inequality 14.551 

To see how easy it is to derive this estimate, first write 



Hc{P'h\P'h) = - j P'h{x)\og 



dx 



P'f2{x) 

^{P'h{x),P'h{x)l 

where ip(u,v) = v\og{u/v) is convex. From the properties of convex functions there always exist sequences of real 
numbers {a„} and {6„} such that 



(/?(«, v) — sup{a„u + bnV : n e N}. 



Remembering (|4.11ll we can then write 
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a„P* fiix) + bnP* f2{x) ^ a„ / k{t,x,yi)fi{yi)dyi+bn J k(t,x,y2)f2{V2)dy2 

{ank{t, X, yi)fi{yi)f2{y2) + bnk{t, x, y2)h{yi)h{y2)} dyidy2 
fi {yi).f2{y2) {ank{t, x,yi) + bnk{t, x, ^2)} dyidy2 
< I I fiiyi)f2{y2)^ik{t,x,yi),k{t,x,y2))dyidy2. 



Thus 



sup {a^P' f lix) + bnP' f 2ix)} =^{P'h{x), P'f2ix)) = 
neN 



fiiyi)f2{y2Mk{t, X, yi), k(t, X, y2))dyidy2, 



H,{P'h{x)\P'j2{x)) > /l(2/l)/2(2/2) 



ip{k{t,x,yi),k{t,x,y2))dx 



dyidy2 



.fi{yi).f2{y2)Hc{k{t, X, yi), k{t, X, y2))dyidy2. 
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